Overview

Dataset Statistics

Number of Variables 29
Number of Rows 2240
Missing Cells 24
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 753.9 KB
Average Row Size in Memory 344.6 B
Variable Types
  • Numerical: 15
  • Categorical: 13
  • DateTime: 1

Dataset Insights

MntFruits and MntSweetProducts have similar distributions Similar Distribution
Income has 24 (1.07%) missing values Missing
Income is skewed Skewed
MntWines is skewed Skewed
MntFruits is skewed Skewed
MntMeatProducts is skewed Skewed
MntFishProducts is skewed Skewed
MntSweetProducts is skewed Skewed
MntGoldProds is skewed Skewed
NumDealsPurchases is skewed Skewed
NumWebPurchases is skewed Skewed
NumCatalogPurchases is skewed Skewed
NumStorePurchases is skewed Skewed
NumWebVisitsMonth is skewed Skewed
Z_CostContact has constant value "3" Constant
Z_Revenue has constant value "11" Constant
Kidhome has constant length 1 Constant Length
Teenhome has constant length 1 Constant Length
AcceptedCmp3 has constant length 1 Constant Length
AcceptedCmp4 has constant length 1 Constant Length
AcceptedCmp5 has constant length 1 Constant Length
AcceptedCmp1 has constant length 1 Constant Length
AcceptedCmp2 has constant length 1 Constant Length
Complain has constant length 1 Constant Length
Z_CostContact has constant length 1 Constant Length
Z_Revenue has constant length 2 Constant Length
Response has constant length 1 Constant Length
MntFruits has 400 (17.86%) zeros Zeros
MntFishProducts has 384 (17.14%) zeros Zeros
MntSweetProducts has 419 (18.71%) zeros Zeros
NumCatalogPurchases has 586 (26.16%) zeros Zeros
  • 1
  • 2
  • 3
  • 4

Variables


ID

numerical

Approximate Distinct Count 2240
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 5592.1598
Minimum 0
Maximum 11191
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ID is skewed right (γ1 = 0.0398)

Quantile Statistics

Minimum 0
5-th Percentile 576.85
Q1 2828.25
Median 5458.5
Q3 8427.75
95-th Percentile 10675.05
Maximum 11191
Range 11191
IQR 5599.5

Descriptive Statistics

Mean 5592.1598
Standard Deviation 3246.6622
Variance 1.0541e+07
Sum 1.2526e+07
Skewness 0.03981
Kurtosis -1.1901
Coefficient of Variation 0.5806

Year_Birth

numerical

Approximate Distinct Count 59
Approximate Unique (%) 2.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 1968.8058
Minimum 1893
Maximum 1996
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Year_Birth is skewed left (γ1 = -0.3497)

Quantile Statistics

Minimum 1893
5-th Percentile 1950
Q1 1959
Median 1970
Q3 1977
95-th Percentile 1988
Maximum 1996
Range 103
IQR 18

Descriptive Statistics

Mean 1968.8058
Standard Deviation 11.9841
Variance 143.6179
Sum 4.4101e+06
Skewness -0.3497
Kurtosis 0.7132
Coefficient of Variation 0.006087
  • Year_Birth has 3 outliers

Education

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 162442
  • The largest value (Graduation) is over 2.32 times larger than the second largest value (PhD)

Length

Mean 7.5187
Standard Deviation 2.8436
Median 10
Minimum 3
Maximum 10

Sample

1st row Graduation
2nd row Graduation
3rd row Graduation
4th row Graduation
5th row PhD

Letter

Count 16436
Lowercase Letter 13710
Space Separator 203
Uppercase Letter 2726
Dash Punctuation 0
Decimal Number 203
  • The top 2 categories (Graduation, PhD) take over 50.0%
  • The largest value (graduation) is over 2.32 times larger than the second largest value (phd)

Marital_Status

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 161444

Length

Mean 7.0732
Standard Deviation 0.8506
Median 7
Minimum 4
Maximum 8

Sample

1st row Single
2nd row Single
3rd row Together
4th row Together
5th row Married

Letter

Count 15844
Lowercase Letter 13598
Space Separator 0
Uppercase Letter 2246
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Married, Together) take over 50.0%

Income

numerical

Approximate Distinct Count 1974
Approximate Unique (%) 89.1%
Missing 24
Missing (%) 1.1%
Infinite 0
Infinite (%) 0.0%
Memory Size 35456
Mean 52247.2514
Minimum 1730
Maximum 666666
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Income is skewed right (γ1 = 6.7589)

Quantile Statistics

Minimum 1730
5-th Percentile 18985.5
Q1 35303
Median 51381.5
Q3 68522
95-th Percentile 84130
Maximum 666666
Range 664936
IQR 33219

Descriptive Statistics

Mean 52247.2514
Standard Deviation 25173.0767
Variance 6.3368e+08
Sum 1.1578e+08
Skewness 6.7589
Kurtosis 159.274
Coefficient of Variation 0.4818
  • Income is not normally distributed (p-value 2.634936839657269e-10)
  • Income has 8 outliers

Kidhome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 0
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • Kidhome has words of constant length

Teenhome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • Teenhome has words of constant length

Dt_Customer

datetime

Distinct Count 663.3458
Approximate Unique (%) 29.6%
Missing 0
Missing (%) 0.0%
Memory Size 18048
Minimum 2012-07-30 00:00:00
Maximum 2014-06-29 00:00:00

Recency

numerical

Approximate Distinct Count 100
Approximate Unique (%) 4.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 49.1094
Minimum 0
Maximum 99
Zeros 28
Zeros (%) 1.2%
Negatives 0
Negatives (%) 0.0%
  • Recency is skewed left (γ1 = -0.002)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 24
Median 49
Q3 74
95-th Percentile 94
Maximum 99
Range 99
IQR 50

Descriptive Statistics

Mean 49.1094
Standard Deviation 28.9625
Variance 838.8237
Sum 110005
Skewness -0.001985
Kurtosis -1.2019
Coefficient of Variation 0.5898

MntWines

numerical

Approximate Distinct Count 776
Approximate Unique (%) 34.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 303.9357
Minimum 0
Maximum 1493
Zeros 13
Zeros (%) 0.6%
Negatives 0
Negatives (%) 0.0%
  • MntWines is skewed right (γ1 = 1.175)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 23.75
Median 173.5
Q3 504.25
95-th Percentile 1000
Maximum 1493
Range 1493
IQR 480.5

Descriptive Statistics

Mean 303.9357
Standard Deviation 336.5974
Variance 113297.8047
Sum 680816
Skewness 1.175
Kurtosis 0.5947
Coefficient of Variation 1.1075
  • MntWines is not normally distributed (p-value 3.9151042950064325e-22)
  • MntWines has 35 outliers

MntFruits

numerical

Approximate Distinct Count 158
Approximate Unique (%) 7.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 26.3022
Minimum 0
Maximum 199
Zeros 400
Zeros (%) 17.9%
Negatives 0
Negatives (%) 0.0%
  • MntFruits is skewed right (γ1 = 2.1007)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 8
Q3 33
95-th Percentile 123
Maximum 199
Range 199
IQR 32

Descriptive Statistics

Mean 26.3022
Standard Deviation 39.7734
Variance 1581.926
Sum 58917
Skewness 2.1007
Kurtosis 4.0393
Coefficient of Variation 1.5122
  • MntFruits is not normally distributed (p-value 2.914954807404711e-21)
  • MntFruits has 227 outliers

MntMeatProducts

numerical

Approximate Distinct Count 558
Approximate Unique (%) 24.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 166.95
Minimum 0
Maximum 1725
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • MntMeatProducts is skewed right (γ1 = 2.0818)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 16
Median 67
Q3 232
95-th Percentile 687.1
Maximum 1725
Range 1725
IQR 216

Descriptive Statistics

Mean 166.95
Standard Deviation 225.7154
Variance 50947.4294
Sum 373968
Skewness 2.0818
Kurtosis 5.5017
Coefficient of Variation 1.352
  • MntMeatProducts is not normally distributed (p-value 6.243197068981278e-22)
  • MntMeatProducts has 175 outliers

MntFishProducts

numerical

Approximate Distinct Count 182
Approximate Unique (%) 8.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 37.5254
Minimum 0
Maximum 259
Zeros 384
Zeros (%) 17.1%
Negatives 0
Negatives (%) 0.0%
  • MntFishProducts is skewed right (γ1 = 1.9185)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 3
Median 12
Q3 50
95-th Percentile 168.05
Maximum 259
Range 259
IQR 47

Descriptive Statistics

Mean 37.5254
Standard Deviation 54.629
Variance 2984.3254
Sum 84057
Skewness 1.9185
Kurtosis 3.0869
Coefficient of Variation 1.4558
  • MntFishProducts is not normally distributed (p-value 1.4767151972203193e-21)
  • MntFishProducts has 223 outliers

MntSweetProducts

numerical

Approximate Distinct Count 177
Approximate Unique (%) 7.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 27.0629
Minimum 0
Maximum 263
Zeros 419
Zeros (%) 18.7%
Negatives 0
Negatives (%) 0.0%
  • MntSweetProducts is skewed right (γ1 = 2.1347)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 8
Q3 33
95-th Percentile 126
Maximum 263
Range 263
IQR 32

Descriptive Statistics

Mean 27.0629
Standard Deviation 41.2805
Variance 1704.0796
Sum 60621
Skewness 2.1347
Kurtosis 4.3641
Coefficient of Variation 1.5254
  • MntSweetProducts is not normally distributed (p-value 1.344673824367651e-22)
  • MntSweetProducts has 248 outliers

MntGoldProds

numerical

Approximate Distinct Count 213
Approximate Unique (%) 9.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 44.0219
Minimum 0
Maximum 362
Zeros 61
Zeros (%) 2.7%
Negatives 0
Negatives (%) 0.0%
  • MntGoldProds is skewed right (γ1 = 1.8848)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 9
Median 24
Q3 56
95-th Percentile 165.05
Maximum 362
Range 362
IQR 47

Descriptive Statistics

Mean 44.0219
Standard Deviation 52.1674
Variance 2721.4417
Sum 98609
Skewness 1.8848
Kurtosis 3.5411
Coefficient of Variation 1.185
  • MntGoldProds is not normally distributed (p-value 2.5540182116988908e-14)
  • MntGoldProds has 207 outliers

NumDealsPurchases

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 2.325
Minimum 0
Maximum 15
Zeros 46
Zeros (%) 2.1%
Negatives 0
Negatives (%) 0.0%
  • NumDealsPurchases is skewed right (γ1 = 2.4169)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 6
Maximum 15
Range 15
IQR 2

Descriptive Statistics

Mean 2.325
Standard Deviation 1.9322
Variance 3.7335
Sum 5208
Skewness 2.4169
Kurtosis 8.9143
Coefficient of Variation 0.8311
  • NumDealsPurchases is not normally distributed (p-value 4.651298587763566e-19)
  • NumDealsPurchases has 86 outliers

NumWebPurchases

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 4.0848
Minimum 0
Maximum 27
Zeros 49
Zeros (%) 2.2%
Negatives 0
Negatives (%) 0.0%
  • NumWebPurchases is skewed right (γ1 = 1.3819)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 2
Median 4
Q3 6
95-th Percentile 9
Maximum 27
Range 27
IQR 4

Descriptive Statistics

Mean 4.0848
Standard Deviation 2.7787
Variance 7.7213
Sum 9150
Skewness 1.3819
Kurtosis 5.6877
Coefficient of Variation 0.6803
  • NumWebPurchases is not normally distributed (p-value 1.9282210382730392e-08)
  • NumWebPurchases has 4 outliers

NumCatalogPurchases

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 2.6621
Minimum 0
Maximum 28
Zeros 586
Zeros (%) 26.2%
Negatives 0
Negatives (%) 0.0%
  • NumCatalogPurchases is skewed right (γ1 = 1.8797)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 2
Q3 4
95-th Percentile 9
Maximum 28
Range 28
IQR 4

Descriptive Statistics

Mean 2.6621
Standard Deviation 2.9231
Variance 8.5445
Sum 5963
Skewness 1.8797
Kurtosis 8.0268
Coefficient of Variation 1.0981
  • NumCatalogPurchases is not normally distributed (p-value 6.141316255067897e-14)
  • NumCatalogPurchases has 23 outliers

NumStorePurchases

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 5.7902
Minimum 0
Maximum 13
Zeros 15
Zeros (%) 0.7%
Negatives 0
Negatives (%) 0.0%
  • NumStorePurchases is skewed right (γ1 = 0.7018)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 3
Median 5
Q3 8
95-th Percentile 12
Maximum 13
Range 13
IQR 5

Descriptive Statistics

Mean 5.7902
Standard Deviation 3.251
Variance 10.5687
Sum 12970
Skewness 0.7018
Kurtosis -0.6233
Coefficient of Variation 0.5615
  • NumStorePurchases is not normally distributed (p-value 1.684625524418003e-11)

NumWebVisitsMonth

numerical

Approximate Distinct Count 16
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 35840
Mean 5.3165
Minimum 0
Maximum 20
Zeros 11
Zeros (%) 0.5%
Negatives 0
Negatives (%) 0.0%
  • NumWebVisitsMonth is skewed right (γ1 = 0.2078)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 3
Median 6
Q3 7
95-th Percentile 8
Maximum 20
Range 20
IQR 4

Descriptive Statistics

Mean 5.3165
Standard Deviation 2.4266
Variance 5.8886
Sum 11909
Skewness 0.2078
Kurtosis 1.8149
Coefficient of Variation 0.4564
  • NumWebVisitsMonth is not normally distributed (p-value 4.682675946827966e-08)
  • NumWebVisitsMonth has 8 outliers

AcceptedCmp3

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 12.74 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.74 times larger than the second largest value (1)
  • AcceptedCmp3 has words of constant length

AcceptedCmp4

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 12.41 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.41 times larger than the second largest value (1)
  • AcceptedCmp4 has words of constant length

AcceptedCmp5

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 12.74 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.74 times larger than the second largest value (1)
  • AcceptedCmp5 has words of constant length

AcceptedCmp1

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 14.56 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 14.56 times larger than the second largest value (1)
  • AcceptedCmp1 has words of constant length

AcceptedCmp2

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 73.67 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 73.67 times larger than the second largest value (1)
  • AcceptedCmp2 has words of constant length

Complain

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 105.67 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 105.67 times larger than the second largest value (1)
  • Complain has words of constant length

Z_CostContact

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 147840

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 3
3rd row 3
4th row 3
5th row 3

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • Z_CostContact has words of constant length

Z_Revenue

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 150080

Length

Mean 2
Standard Deviation 0
Median 2
Minimum 2
Maximum 2

Sample

1st row 11
2nd row 11
3rd row 11
4th row 11
5th row 11

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 4480
  • Z_Revenue has words of constant length

Response

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 147840
  • The largest value (0) is over 5.71 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2240
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.71 times larger than the second largest value (1)
  • Response has words of constant length

Interactions

Correlations

Missing Values